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Status of Claims 

Claims 1-22 are under final rejection, and are each appealed. 
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Status of Amendments after Final 

No claims were amended after final rejection. 
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SUMMARY OF CLAIMED SUBJECT MATTER 

The following summary refers to disclosed embodiments and their advantages, but does not delimit 
any of the claimed inventions. 

In General 

The present application is directed, in general, to data processors and, more specifically, to a 
data processor capable of executing load operations having different latencies. Page 3, lines 7-9. 
Support for Independent Claims 

Note that, per 37 CFR §41.37, only each of the independent claims are discussed in this 
section, as well as any claims including means-plus-function language that is argued separately 
below. In the arguments below, however, the dependent claims are also discussed and distinguished 
from the prior art. The discussion of the claims is for illustrative purposes, and is not intended to 
effect the scope of the claims. 

Regarding Claim 1 , a data processor 100 includes an instruction pipeline 400. {Application, 
Page 24, Lines 20-22). The pipeline 400 includes N processing stages 401-407. (Application, Page 
25, Lines 1-4). Each of the processing stages 401-407 performs an execution step associated with a 
pending instruction being executed by the pipeline 400. {Application, Page 25, Line 5 - Page 28, 
Line #). A data cache 330 is capable of storing data values used by the pending instruction. 
(Application, Page 11, Lines 1 1-12; Page 28, Lines 13-19). A plurality ofregisters 3 10 is capable of 
receiving the data values from the data cache 330. (Application, Page 1 1, Lines 13-14; Page 27, Line 



Appeal Brief- Serial No. 09/751,372 



Page 7 



Attorney Docket No. 00-BN-051 (STMI01-00051) 
U.S. Serial No. 09/751,372 
Patent 

22 - Page 28, Line 2). A load store unit 325 is capable of transferring a first data value from the data 
cache 330 to a target register during execution of a load operation. (Application, Page 29, Line 8 - 
Page 30, Line 2). A shifter circuit 535 is capable of shifting, sign extending, or zero extending the 
first data value prior to loading the first data value into the target register. (Application, Page 29, 
Lines 1 7-22). Bypass circuitry 530 is capable of transferring the first data value from the data cache 
330 directly to the target register without processing the first data value in the shifter circuit 535. 
(Application, Page 28, Line 20 - Page 29, Line 7). 

Regarding Claim 1 0, a method 600 for performing load operations in a data processor 100 is 
provided. (Application, Page 29, Lines 8-10). The method 600 includes determining if a pending 
instruction in an N-stage execution pipeline 400 is one of a load word operation, a load half-word 
operation, and a load byte operation. (Application, Page 29, Lines 13-22). If the pending instruction 
is a load half-word operation or a load byte operation, the method 600 includes transferring a first 
data value from a data cache 330 to a shifter circuit 535 and shifting the first data value prior to 
loading the first data value into a target register. (Application, Page 29, Lines 10-13 and 1 7-22). If 
the pending instruction is a load word operation, the method 600 includes transferring the first data 
value from the data cache 330 directly to the target register without processing the first data value in 
the shifter circuit 535. (Application, Page 29, Lines 13-17). 

Regarding Claim 14, a processing system 10 includes a data processor 100, a memory 130, 
and a plurality of memory-mapped peripheral circuits 111-114 for performing selected functions. 
(Application, Page 17, Lines 9-17). The data processor 400 includes an instruction execution 
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pipeline 400 having N processing stages 401-407, each of which is capable of performing an 
execution step associated with a pending instruction being executed. (Application, Page 24, Line 20 
- Page 28, Line 8). The data processor 400 also includes a data cache 330 capable of storing data 
values used by the pending instruction. (Application, Page 11, Lines 11-12; Page 28, Lines 13-19). 
The data processor 400 further includes a plurality of registers 310 capable of receiving the data 
values from the data cache 330. (Application, Page 11, Lines 13-14; Page 27, Line 22 - Page 28, 
Line 2). The data processor 400 also includes a load store unit 325 capable of transferring a first data 
value from the data cache 330 to a target register during execution of a load operation. (Application, 
Page 29, Line 8 - Page 30, Line 2). The data processor 400 further includes a shifter circuit 535 
capable of shifting, sign extending, or zero extending the first data value prior to loading the first 
data value into the target register. (Application, Page 29, Lines 17-22). In addition, the data 
processor 400 includes bypass circuitry 530 capable of transferring the first data value from the data 
cache 330 directly to the target register without processing the first data value in the shifter circuit 
535. (Application, Page 28 s Line 20 - Page 29, Line 7). 

With regard to claim 23, a data processor 400 also a data cache 330. (Application, Page 11, 
Lines 7-12). The data processor 400 further includes a plurality of registers 3 1 0 (Application, Page 
11). The data processor 400 further includes a shifter circuit 535 capable of shifting, sign extending, 
or zero extending the first data value prior to loading the first data value into the target register. 
(Application, Page 29, Lines 1 7-22). In addition, the data processor 400 includes bypass circuitry 
530 capable of transferring the first data value from the data cache 330 directly to the target register 
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without processing the first data value in the shifter circuit 535. (Application, Page 28, Line 20 - 
Page 29, Line 7). 

With regard to claim 27, a method includes shifting, sign extending, or zero extending a first 
data value from a cache and providing a modified first data value to a first of a plurality of registers. 
(Application, Page 29, Line 10 -Page 30, Line 2). The method 600 includes transferring a second 
data value from the data cache 330 directly to the target register without shifting, sign extending, or 
zero extending the second data value. (Application, Page 29, Lines 13-17). 

With regard to claim 29, a system 1 0 includes a processor 400. (Application, Page 1 7, Lines 
9-17). The processor 400 includes a a cache 300. . (Application, Page 11, Lines 11-12; Page 28, 
Lines 13-19). The processor 400 includes a plurality of registers 310. (Application, Page 11, Lines 
13-14; Page 27, Line 22 - Page 28, Line 2). The processor 400 includes a shifter circuit535 capable 
of shifting, sign extending, or zero extending a data value from the cache and providing a modified 
data value to a target one of the registers (Application, Page 29, Lines 17-22). The processor 400 
includes a bypass circuit capable of transferring the data value from the cache to the target register 
without processing the data value in the shifter circuit (Application, Page 28, Line 20- Page 29, 
Line 7). The system 10 includes a memory coupled 130 to the processor. (Application, Page 17, 
Lines 9-17). The system 10 includes a plurality of peripheral circuits 1 1 1-1 14 capable of performing 
selected functions in association with the processor. (Application, Page 17, Lines 9-17). 
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Grounds of Rejection to be Reviewed on Appeal 

1. Are Claims 1-22 under 35 U.S.C. § 103(a) as being unpatentable over U.S. Patent No. 

5.761.469 to Greenlev et al. ("Greenlev") in view of U.S. Patent No. 
5.706.481 to Hannah, et al. ("Hannah")? 
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ARGUMENT 

Stated Grounds of Rejection 

The rejections outstanding against the Claims are as follows: 

1 . In the December 29, 2006 Office Action, Claims 1 -29 were rejected under 35 U.S.C. 
§ 103(a) as being unpatentable over U.S. Patent No. 5,761,469 to Greenley et al. ("Greenley") in 
view of U.S. Patent No. 5,706,481 to Hannah, et al. ("Hannah"). 

Legal Standards 

Obviousness: In ex parte examination of patent applications, the Patent Office bears the 
burden of establishing a prima facie case of obviousness. (MPEP § 2142; In re Fritch, 972 F.2d 
1260, 1262, 23 U.S.P.Q.2d 1780, 1783 (Fed. Cir. 1992). The initial burden of establishing a prima 
facie basis to deny patentability to a claimed invention is always upon the Patent Office. (MPEP § 
2142; InreOetiker, 977 F.2d 1443, 1445, 24 U.S.P.Q.2d 1443, 1444(Fed. Cir. 1992; Inre Piasecki, 
745 F.2d 1468, 1472, 223 U.S.P.Q. 785, 788 (Fed. Cir. 1984)). Only when a prima facie case of 
obviousness is established does the burden shift to the Applicant to produce evidence of 
nonobviousness. (MPEP § 2142; In re Oetiker, 977 F.2d 1443, 1445, 24 U.S.P.Q.2d 1443, 1444 
(Fed. Cir. 1992; In re Rijckaert, 9F.3dl531, 1532, 28 U.S.P.Q.2d 1955, 1956 (Fed. Cir. 1993). If 
the Patent Office does not produce a prima facie case of unpatentability, then without more the 
Applicant is entitled to grant of a patent. (In re Oetiker, 977 F.2d 1443, 1445, 24 U.S.P.Q.2d 1443, 
1444 (Fed. Cir. 1992; In re Grabiak, 769 F.2d 729, 733, 226 U.S.P.Q. 870, 873 (Fed. Cir. 1985). 
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A prima facie case of obviousness is established when the teachings of the prior art itself 
suggest the claimed subject matter to a person of ordinary skill in the art. (In re Bell, 991 F.2d 781, 
783, 26 U.S.P.Q.2d 1529, 1531 (Fed Cir. 1993). To establish & prima facie case of obviousness, 
three basic criteria must be met. First, there must be some suggestion or motivation, either in the 
references themselves or in the knowledge generally available to one of ordinary skill in the art, to 
modify the reference or to combine reference teachings. Second, there must be a reasonable 
expectation of success. Finally, the prior art reference (or references when combined) must teach or 
suggest all the claim limitations. The teaching or suggestion to make the claimed invention and the 
reasonable expectation of success must both be found in the prior art, and not based on the 
Applicant's disclosure. (MPEP § 2142). 

In order to establish obviousness by combining references there must be some teaching or 
suggestion in the prior art to combine the references. Arkie Lures, Inc. v. Gene Larew Tackle, Inc., 
119 F.3d 953, 957, 43 USPQ2d 1294, 1297 (Fed.Cir. 1997) ("It is insufficient to establish 
obviousness that the separate elements of an invention existed in the prior art, absent some teaching 
or suggestion, in the prior art, to combine the references."); re Rouffet, 149 F.3d 1350, 1355-56, 
47 USPQ2d 1453, 1456 (Fed.Cir. 1998) ("When a rejection depends on a combination of prior art 
references, there must be some teaching, or motivation to combine the references.") 

Evidence of a motivation to combine prior art references must be clear and particular if the 
trap of "hindsight" is to be avoided. In re Dembiczak, 175 F.3d 994, 50 USPQ2d 1614 (Fed.Cir. 
1999) (Evidence of a suggestion, teaching or motivation to combine prior art references must be 



Appeal Brief- Serial No. 09/751,372 



Page 13 



Attorney Docket No. 00-BN-051 (STMI01-00051) 
U.S. Serial No. 09/751,372 
Patent 

"clear and particular." "Broad conclusory statements regarding the teaching of multiple references, 
standing alone, are not 'evidence.'"). In re Roufett, 149 F.3d 1350, 1357, 47 USPQ2d 1453, 1457 
(Fed.Cir. 1 998) ("[Rejecting patents solely by finding prior art corollaries for the claimed elements 
would permit an examiner to use the claimed invention itself as a blueprint for piecing together 
elements in the prior art to defeat the patentability of the claimed invention. Such an approach would 
be 'an illogical and inappropriate process by which to determine patentability.'") 

Analysis of Examiner's Rejection 
First Ground of Rejection 

Claims 1-22 were rejected under 35 U.S.C. § 103(a) as being unpatentable over U.S. Patent 
No. 5,761,469 to Greenley et al ("Greenley") in view of U.S. Patent No. 5,706,481 to Hannah et 
al ("Hannah"). 

Greenley recites a method and apparatus for improving the performance of pipelined 
processors. {Abstract). The apparatus of Greenley includes a data cache 180, an aligning unit 170, 
and a sign extension unit 160. {Figure 7). The aligning unit 170 aligns data retrieved from the data 
cache 180, and the sign extension unit 160 extends the sign of the retrieved data. {Col 2, Lines 32- 
54). In particular, the aligning unit 1 70 right justifies the data retrieved from the data cache 1 80, and 
the sign extension unit 160 fills unoccupied bits with sign information. {Col 2, Lines 36-50). 

Hannah describes an apparatus and method for integrating texture memory and interpolation 
logic. Hannah describes "interpolator chains" in which certain portions of the "slices" can be 
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disabled. (Col. 9,Llines 30-67). 
Claim 1 

Independent claim 1 describes 

1 . A data processor comprising: 

an instruction execution pipeline comprising N processing 
stages, each of said N processing stages capable of performing one of 
a plurality of execution steps associated with a pending instruction 
being executed by said instruction execution pipeline; 

a data cache capable of storing data values used by said 
pending instruction; 

a plurality of registers capable of receiving said data values 
from said data cache; 

a load store unit capable of transferring a first one of said data 
values from said data cache to a target one of said plurality of 
registers during execution of a load operation; 

a shifter circuit associated with said load store unit capable of 
one of a) shifting, b) sign extending, or c) zero extending said first 
data value prior to loading said first data value into said target 
register; and 
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bypass circuitry associated with said load store unit capable of 
transferring said first data value from said data cache directly to said 
target register without processing said first data value in said shifter 
circuit. 

Claim 1 requires a "load store unit" capable of transferring a first data value from a "data 
cache" to a "target one of [a] plurality of registers" during execution of a load operation. Claim 1 
also recites a "shifter circuit" capable of shifting, sign extending, or zero extending the first data 
value "prior to loading [the] first data value into [the] target register " In addition, Claims 1 and 14 
recite "bypass circuitry" capable of "transferring [the] first data value from [the] data cache directly 
to [the] target register without processing [the] first data value in [the] shifter circuit." 

Claims 1 is crystal clear - the "shifter circuit" can process a data value before the data value 
is loaded from a data cache into a target register. Also, the "bypass circuitry" can transfer the data 
value from the data cache directly to the target register without the data value being processed by the 
"shifter circuit" (thereby bypassing the shifter circuit). 
Examiner Li alleges that Greenley teaches 

A shifter circuit (Greenley 160,170 of Fig. 1) associated with said 
load store unit capable of one of a) shifting (Greenley Col.2 lines 19- 
31), b) sign extending (Greenley Col. 2 lines 48-54), and c) zero 
extending (Greenley Col. 2 lines 45-47) said first data value prior to 
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loading said first data value into said target register; 
Examiner Li is incorrect; Greenley actually teaches, in col. 2, lines 19-54: 

Since needed data may not be physically stored consecutively 
in data cache 1 80, a LOAD access to data cache 1 80 must insure that 
the accessed data is aligned (i.e. both the upper and lower half of a 
word is fetched) into an appropriate format to write into a register in 
the register file 150. For example, when accessing a half-word from 
the data cache 180, the alignment unit 170 assures that the least 
significant bit of the half-word is in the least significant bit location in 
the register and the most significant bit of the half-word is in the bit 
position 15 in the register. After the word is aligned, it may be sign 
extended if the sign of the accessed word is ascertained during a 
signed load operation. 

Referring to FIG. 2, a detailed block diagram of the alignment 
unit 170 and the sign extension unit 160 coupled between the data 
cache 180 and a target register in the register file 150 is shown. The 
purpose of the alignment unit 170 is to right justify the data fetched 
from the data cache 180. For example, when a byte is fetched from 
the data cache 1 80, regardless of its byte position (i.e., 0 through 7) in 
the cache, the byte is always right justified in the alignment unit. 
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Similarly, a half word is also right justified in the alignment unit 170, 
regardless of what position the half word occupied in the cache 180, 
or even if the two bytes of the half word were present in two different 
cache lines in the data cache 1 80. Words, double words, and extended 
words are similarly aligned by the alignment unit 1 70. In an unsigned 
load operation, the rest of the register not loaded with actual data is 
zero filled. 

The purpose of the sign extension unit 1 60 is to fill in the 
unoccupied bits of a register with sign information indicative of the 
sign of the data loaded from the data cache 1 80. The sign of the data 
is determined by its most significant bit ("MSB"). For example, if the 
MSB of the data is a "0", the data is considered positive. On the other 
hand, if the MSB is a " 1 ", then the data is considered negative. 



Examiner Li therefore claims that sign extension unit 160 and alignment unit 170 together 
form the claimed "shifter circuit." Examiner Li is correct in that sign extension unit 160 does 
perform sign extending, and the alignment unit 1 70 does perform zero filling and shifting. In fact, 
Greenley teaches that the alignment must be assured, and that alignment unit 170 assures that the 
appropriate bits of the half-words are in the appropriate bit positions. See col. 2, lines 19-30. It is 
clear from this that, in Greenley's system, that the "shifter circuit" of 1 60/1 70 cannot be bypassed, as 
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those functions must be present. 

Examiner Li's position is therefore contradicted by the express recitations of Greenley. 
Regarding the aligning unit, Greenley specifically recites that load access to a data cache "must" 
insure that the accessed data is aligned. {Col 2, Lines 20-25), Greenley also specifically recites that 
"double words" are "similarly aligned by the aligning unit." Regarding the sign extension unit, 
Greenley specifically recites that "double-words" are "similarly sign extended" by "the sign 
extension unit." {Col 2, Lines 60-61). 

It is clear, then, that Greenley does not, and cannot, include "bypass circuitry" as claimed, 
and Greenley fails to disclose, teach, or suggest a structure that allows a data value to be transferred 
either (i) from a data cache to a target register through a shifter circuit, or (ii) directly from the data 
cache to the target register while bypassing the shifter circuit, as conceded in the Office Action. 

In Greenley, all data passes from the data cache 180 through the aligning unit 170 and the 
sign extension unit 160 into the register files 150. Because of this, Greenley recites that all data 
passes through a shifter. More specifically, Greenley recites that all data passes through a shifter (the 
alignment unit 170 and the sign extension unit 160) before the data is stored in a register file 150. 

Examiner Li instead references Hannah, and alleges that "Hannah has taught bypassing 
functions circuitry capable of transferring said first data value to said target without processing said 
first data value in said shifter circuit (Hannah column 9, lines 3 1-67; Figure 11; Figure 12, Figure 13, 
and Figure 14)." This passage actually teaches: 
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FIG. 1 1 shows three examples of different configurations that can be 
used for the interpolator chains in the first configuration, four of the 
4-bit unit 1101 can be stacked to perform 16-bit tri-linear 
interpolations. In the second configuration, a pair of the 8-bit unit 
1 102 can be stacked to perform the 16-bit tri-linear interpolations. In 
the third configuration, a single 12-bit unit 1 103 is used to perform 
the interpolations. A number of multiplexers are programmed to 
couple the various units in order to achieve the desired configuration. 
In this manner, the same hardware can be utilized with minimal extra 
gate count to attain the flexibility in data width. In some applications, 
greater precision and resolution are desirable, whereas, in other 
applications, speed and cost are of greater importance. 

In the currently preferred embodiment, the units 1101-1103 are 
comprised of an S-stage, a T-stage, and an LOD-stage. The S-stages 
are comprised of 4-bit slices 1 104. The T-stages and LOD Stages are 
comprised of 6-bit slices 1 105. These slices are designed to make the 
interpolators modular. FIG. 12 shows a basic 4-bit input interpolation 
slice. 
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The data to the muxes comes from the subtractor (not shown). The 
most significant (MS) bits are either sign extension (if the slice is 
being used as the MS slice) or are the low order bits from the next 
more significant slice (when not the most significant slice). Since 
each slice handles both a sign bit and its extension, as well as fraction 
bits, these capabilities are disabled or bypassed as appropriate for the 
position of the slice in the larger word. For example, for 8-bit texels, 
the MS slice would handle the sign bit (and no fraction bits) and the 
least significant (IS) slice would handle the fraction bits (but no sign). 
The slice configuration muxes separately enable or disable the slices 
MS portion (sign extension) or LS portion (fraction). FIG. 13 shows 
an 8-bit input interpolator, which is formed by coupling together two 
4-bit slices. FIG. 14 shows a basic 6-bit input interpolator slice. 



As is clear, at no point in this passage does Hannah teach or suggest bypass circuitry, as 
claimed, that is capable of transferring a first data value from a data cache, as no data cache is shown 
or described here. One cache described by Hannah is read cache 208, shown in Figure 2, which 
contains mip maps used by interpolator 209. Interpolator 209 resamples the texture image to 
produce the output samples, which are sent to the graphics rasterizer. These mip maps are not data 
values used by the pending instructions in an instruction execution pipeline, to transferred into a 
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target register, as required by the claims. Nor do the TRAM caches meet the limitations of the 
claims, nor the I/O caches (e.g. 407). 

As is clear, at no point in this passage, or anywhere else in Hannah, does Hannah teach or 
suggest bypass circuitry, as claimed, that is capable of transferring a first data value directly to a 
target register, as no registers are shown or described here. Hanna does not include any bypass 
circuitry that transfers any data values from a data cache to a register. 

As is clear, at no point in this passage, or anywhere else in Hannah, does Hannah teach or 
suggest bypass circuitry, as claimed, that is capable of transferring the first data value without 
processing the first data value in a shifter circuit. 

As is clear, at no point in this passage, or anywhere else in Hannah, does Hannah teach or 
suggest bypass circuitry, as claimed, that is associated with a load store unit. 

Examiner Li does not even attempt to show where these limitations may be found within 
Hannah or these figures, and fails to allege even a single specific element that could meet the claim 
limitations. Indeed, this passage of Hannah appears to be cited simply because it includes the term 
"bypassed". The particular sentence of Hannah the Examiner reiterates in response is "Since each 
slice handles both a sign bit and its extension, as well as fraction bits, these capabilities are disabled 
or bypassed as appropriate for the position of the slice in the larger word." The Examiner then 
acknowledges that Hannah doesn't actually teach any bypass circuitry, stating: 

Hannah also teaches in column 9, lines 37-44 that his implementation 
of the multiplexers he minimizes gate count, e.g. reduce the size of 
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the device, and attains "flexibility in data width." Hannah then 
proceeds to state that "In some applications greater precision and 
resolution are desirable, whereas, in other applications, speed and cost 
are of greater importance." This suggests that how the multiplexers 
are used, e.g. to bypass certain capabilities, dependent on the 
designers desire. Bypassing effectively disables a functionality, such 
as the sign extension mentioned in both Greenley and Hannah, so that 
it is not performed and time is not wasted on an unneeded 
functionality. Consequently, the speed is increased in the system. 
The Examiner does not - and tacitly admits that she cannot - identify any specific circuit in 
Hannah that is a bypass circuitry associated with a load store unit capable of transferring a first data 
value from a data cache directly to a target register without processing the first data value in a shifter 
circuit, as claimed. To the extent anything in Hannah is bypassed at all, there is no "bypass circuitry" 
connected as claimed or capable of performing the claimed functions. 

Examiner Li alleges that "Hannah has taught bypassing functions circuitry capable of 
transferring said first data value to said target without processing said first data value in said shifter 
circuit." Examiner Li is incorrect - no data cache is shown or described here, as required by the 
claims, and as Examiner Li attempts to ignore by simply referencing a "target". One cache 
described by Hannah is read cache 208, shown in Figure 2, which contains mip maps used by 
interpolator 209. Interpolator 209 resamples the texture image to produce the output samples, which 
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are sent to the graphics rasterizer. These mip maps are not data values used by the pending 
instructions in an instruction execution pipeline, to transfer into a target register, as required by the 
claims. Nor do the TRAM caches meet the limitations of the claims, nor the I/O caches (e.g. 407). 

Examiner Li therefore fails to make a prima facie obviousness rejection. Even if there were 
motivation to combine these references, the proposed combination still does not include the claim 
limitations. 

Since Greenley does not teach or suggest the claimed features, and Hannah does not teach or 
suggest the claimed features, no combination of these references can teach or suggest the claimed 
features. As these features are found in all independent claims, all claims distinguish over Greenley, 
Hannah, and any combination of them. Greenley, in fact, teaches away from the Examiner's 
proposed combination and further modification by teaching that the shift functions must be used for 
any data transfer. 

Applicant further notes that the Examiner Li's stated motivation is that "bypasses improve 
the performance of a system by minimizing delays from unnecessary functions (Hannah column 9, 
lines 38-44)". This portion of Hannah is part of the passage reproduced above, and it is clear that 
there is no such teaching in this passage. While Hannah teaches a trade-off between 
precision/resolution and speed/cost, this has nothing to do with any bypass circuitry, despite the 
Examiner's rhetorical bootstrapping. There is no proper motivation in the reference itself, explicitly 
or implicitly, nor in the knowledge of one of ordinary skill in the art, nor in the problem to be solved. 

The rejection of this claim should be reversed. 
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Claim 2 

Claim 2 requires, among other limitations, that "said bypass circuitry transfers said first data 
value from said data cache directly to said target register during a load word operation," 

This limitation, in combination with the limitations of the parent claim, is not taught or 
suggested by any art of record, alone or in combination. 

Claim 2 depends from claim 1 ; therefore the arguments above with respect to the parent 
claim apply here as well, and are hereby incorporated by reference. 

Examiner Li does not make any attempt to show this limitation in the art , and so fails to make 
a prima facie rejection. The Examiner simply asserts that "Greenley's loading of a double word has 
the same consequences as applicant's loading of a word." Even if this were true, it is irrelevant -the 
Examiner is required to show that the claim limitation itself is taught in some reference, and the 
Examiner cannot. 

As Greenley doesn't even include bypass circuitry, as the Examiner admits, the Examiner's 
reliance on Greeley for a limitation relating to the operation of the bypass circuitry is particularly 
empty. This particular transfer operation is not taught or suggested by any art of record, and certainly 
is not taught as being performed during a load word operation. 

The rejection of this claim should be reversed. 
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Claim 3 

Claim 3 requires, among other limitations, that "bypass circuitry transfers said first data value 
from said data cache directly to said target register at the end of two machine cycles." 

This limitation, in combination with the limitations of the parent claims, is not taught or 
suggested by any art of record, alone or in combination. 

Claim 3 depends from claim 2; therefore the arguments above with respect to the parent 
claims apply here as well, and are hereby incorporated by reference. 

The Examiner again relies on Greenley to teach this limitation. As Greenley doesn't even 
include bypass circuitry, as the Examiner admits, the Examiner's reliance on Greeley for a limitation 
relating to the operation of the bypass circuitry is particularly empty. This particular transfer 
operation is not taught or suggested by any art of record, and certainly is not taught as being 
performed during a load word operation, and at the end of two machine cycles. 

The rejection of this claim should be reversed. 

Claim 4 

Claim 4 requires, among other limitations, that "shifter circuit one of a) shifts, b) sign 
extends, or c) zero extends said first data value prior to loading said first data value into said target 
register during a load half-word operation." 

This limitation, in combination with the limitations of the parent claim, is not taught or 
suggested by any art of record, alone or in combination. 
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Claim 4 depends from claim 1; therefore the arguments above with respect to the parent 
claim apply here as well, and are hereby incorporated by reference. The rejection of this claim 
should be reversed. 

Claim 5 

Claim 5 requires, among other limitations, that the "shifter circuit loads said shifted first data 
value into said target register at the end of three machine cycles." 

This limitation, in combination with the limitations of the parent claims, is not taught or 
suggested by any art of record, alone or in combination. 

Claim 5 depends from claim 4; therefore the arguments above with respect to the parent 
claims apply here as well, and are hereby incorporated by reference. 

The Examiner makes no showing in the art of this limitation. Instead, the Examiner 
hypothesizes what latency savings might occur in the proposed combination, without making any 
reference to the teachings of the art itself, and then indicates that the claim limitation is simply a 
"change in magnitude of latency" over a hypothesis based on a proposed combination. The 
limitation is not taught or suggested by the art of record, alone or in combination, and the Examiner 
cannot make a showing of any actual relevant teaching in the art. 

The rejection of this claim should be reversed. 
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Claim 6 

Claim 6 requires, among other limitations, that "shifter circuit one of a) shifts, b) sign 
extends, or c) zero extends said first data value prior to loading said first data value into said target 
register during a load byte operation." 

This limitation, in combination with the limitations of the parent claim, is not taught or 
suggested by any art of record, alone or in combination. 

Claim 6 depends from claim 1; therefore the arguments above with respect to the parent 
claim apply here as well, and are hereby incorporated by reference. The rejection of this claim 
should be reversed. 

Claim 7 

Claim 7 requires, among other limitations, that the "shifter circuit loads said shifted first data 
value into said target register at the end of three machine cycles." 

This limitation, in combination with the limitations of the parent claims, is not taught or 
suggested by any art of record, alone or in combination. 

Claim 7 depends from claim 6, therefore the arguments above with respect to the parent 
claims apply here as well, and are hereby incorporated by reference. 

The Examiner makes no showing in the art of this limitation. Instead, the Examiner 
hypothesizes what latency savings might occur in the proposed combination, without making any 
reference to the teachings of the art itself, and then indicates that the claim limitation is simply a 
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"change in magnitude of latency" over a hypothesis based on a proposed combination. The 
limitation is not taught or suggested by the art of record, alone or in combination, and the Examiner 
cannot make a showing of any actual relevant teaching in the art. 
The rejection of this claim should be reversed. 

Claim 8 

Claim 8 requires, among other limitations, that the "bypass circuitry comprises a multiplexer 
having a first input channel coupled to a data output of said data cache." 

This limitation, in combination with the limitations of the parent claims, is not taught or 
suggested by any art of record, alone or in combination. 

Claim 8 depends from claim 1; therefore the arguments above with respect to the parent 
claim apply here as well, and are hereby incorporated by reference. 

As described above, neither reference teaches the claimed bypass circuitry. The Examiner 
does not even attempt to show that this limitation is met, and so fails to make even a prima facie 
obviousness rejection . Instead, the Examiner simply alleges that Hannah has a multiplexer with an 
input channel coupled to "an output of another device". 

This is not the claim limitation. Hannah does not teach or suggest the claim limitation. 
Virtually any multiplexer input is connected to an output of something, but that is not enough to 
teach or suggest the limitations of the claim. 

The rejection of this claim should be reversed. 
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Claim 9 

Claim 9 requires, among other limitations, that the "multiplexer has a second input channel 
coupled to an output of said shifter circuit." 

This limitation, in combination with the limitations of the parent claims, is not taught or 
suggested by any art of record, alone or in combination. 

Claim 9 depends from claim 8; therefore the arguments above with respect to the parent 
claims apply here as well, and are hereby incorporated by reference. 

As described above, neither reference teaches the claimed bypass circuitry. The Examiner 
does not even attempt to show that this limitation is met, and so fails to make even a prima facie 
obviousness rejection . Instead, the Examiner simply alleges that Hannah has a multiplexer with an 
input channel coupled to "an output of another device". 

This is not the claim limitation. Hannah does not teach or suggest the claim limitation. 
Virtually any multiplexer input is connected to an output of something, but that is not enough to 
teach or suggest the limitations of the claim. 

As can be seen, with respect to Claims 8 and 9, the Examiner has not shown any multiplexer 
connected as required, and has not even alleged any teaching corresponding to these claim 
limitations. 

The rejection of this claim should be reversed. 
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Claim 10 

Independent claim 10 describes 

10. A method of loading a first data value from a data cache 
into a target register of a plurality of registers, the method comprising 
the steps of: 

determining if a pending instruction in an N-stage execution 
pipeline is one of a load word operation, a load half-word operation, 
and a load byte operation; 

in response to a determination that the pending instruction is a 
load half-word operation, transferring the first data value from the 
data cache to a shifter circuit and shifting the first data value prior to 
loading the first data value into the target register; 

in response to a determination that the pending instruction is a 
load byte operation, transferring the first data value from the data 
cache to the shifter circuit and shifting the first data value prior to 
loading the first data value into the target register; and 

in response to a determination that the pending instruction is a 
load word operation, transferring the first data value from the data 
cache directly to the target register without processing the first data 
value in the shifter circuit. 
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Claim 10 requires "transferring the first data value from the data cache directly to the target 
register without processing the first data value in the shifter circuit." This limitation is not taught or 
suggested by the art of record. The Examiner alleges that this is taught by Greenly at col. 2, lines 35- 
40, but the Examiner is mistaken: 

The purpose of the alignment unit 170 is to right justify the data 
fetched from the data cache 1 80. For example, when a byte is fetched 
from the data cache 1 80, regardless of its byte position (i.e., 0 through 
7) in the cache, the byte is always right justified in the alignment unit. 
It is clear that the passage on which the Examiner relies does not teach or suggest anything at 
all related to the claim limitation. 

Further, neither Greenley or Hannah teach or suggest doing anything in response to a 
determination that a pending instruction is a load word operation as recited in Claim 10. The 
Examiner's rejection on this point repeats both the combination above and the motivation addressed 
below, neither of which teach or suggest anything at all with respect to this particular limitation. As 
the BPAI has noted recently in cases such as Ex Parte Rinkevich et ah (BPAI 2007-1317), a 
consideration of when processes occur in response to other actions can illustrate a patentable 
distinction. 

The rejection of this claim should be reversed. 
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Claim 11 

Claim 1 1 requires, among other limitations, that "the step of transferring the first data value 
requires two machine cycles during a load word operation." 

This limitation, in combination with the limitations of the parent claims, is not taught or 
suggested by any art of record, alone or in combination. 

Claim 1 1 depends from claim 10; therefore the arguments above with respect to the parent 
claims apply here as well, and are hereby incorporated by reference. 

The Examiner again relies on Greenley to teach this limitation. Greenley doesn't even the 
direct transfer of the parent claim. This particular transfer operation is not taught or suggested by 
any art of record, and certainly is not taught as being performed during a load word operation, and 
requiring two machine cycles. 

The rejection of this claim should be reversed. 

Claim 12 

Claim 12 requires, among other limitations, that "the step of transferring the first data value 
requires three machine cycles during a load half-word operation." 

This limitation, in combination with the limitations of the parent claims, is not taught or 
suggested by any art of record, alone or in combination. 

Claim 12 depends from claim 10; therefore the arguments above with respect to the parent 
claims apply here as well, and are hereby incorporated by reference. 



Appeal Brief- Serial No. 09/751,372 



Page 33 



ATTORNEY DOCKET NO. 00-BN-051 (STMI01-00051) 

U.S. Serial No. 09/751,372 
Patent 

The Examiner makes no showing in the art of this limitation. Instead, the Examiner 
hypothesizes what latency savings might occur in the proposed combination, without making any 
reference to the teachings of the art itself, and then indicates that the claim limitation is simply a 
"change in magnitude of latency" over a hypothesis based on a proposed combination. The 
limitation is not taught or suggested by the art of record, alone or in combination, and the Examiner 
cannot make a showing of any actual relevant teaching in the art. 

The rejection of this claim should be reversed. 

Claim 13 

Claim 1 3 requires, among other limitations, that "the step of transferring the first data value 
requires three machine cycles during a load byte operation." 

This limitation, in combination with the limitations of the parent claims, is not taught or 
suggested by any art of record, alone or in combination. 

Claim 13 depends from claim 10; therefore the arguments above with respect to the parent 
claims apply here as well, and are hereby incorporated by reference. 

The Examiner makes no showing in the art of this limitation. Instead, the Examiner 
hypothesizes what latency savings might occur in the proposed combination, without making any 
reference to the teachings of the art itself, and then indicates that the claim limitation is simply a 
"change in magnitude of latency" over a hypothesis based on a proposed combination. The 
limitation is not taught or suggested by the art of record, alone or in combination, and the Examiner 
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make a showing of any actual relevant teaching in the art. 
The rejection of this claim should be reversed. 

Claim 14 

Independent claim 14 describes 

14. A processing system comprising: 
a data processor comprising: 

an instruction execution pipeline comprising N processing 
stages, each of said N processing stages capable of performing one of 
a plurality of execution steps associated with a pending instruction 
being executed by said instruction execution pipeline; 

a data cache capable of storing data values used by said 
pending instruction; 

a plurality of registers capable of receiving said data values 
from said data cache; 

a load store unit capable of transferring a first one of said data 
values from said data cache to a target one of said plurality of 
registers during execution of a load operation; 

a shifter circuit associated with said load store unit capable of 
one of a) shifting, b) sign extending, or c) zero extending said first 



Appeal Brief- Serial No. 09/751,372 



Page 35 



Attorney Docket No. 00-BN-051 (STMI01-00051) 

U.S. Serial No. 09/751,372 
Patent 

data value prior to loading said first data value into said target 
register; and 

bypass circuitry associated with said load store unit capable of 
transferring said first data value from said data cache directly to said 
target register without processing said first data value in said shifter 
circuit; 

a memory coupled to said data processor; and 
a plurality of memory-mapped peripheral circuits coupled to 
said data processor for performing selected functions in association 
with said data processor. 
Claim 14 requires a "load store unit" capable of transferring a first data value from a "data 
cache" to a "target one of [a] plurality of registers" during execution of a load operation. Claim 14 
also recites a "shifter circuit" capable of shifting, sign extending, or zero extending the first data 
value "prior to loading [the] first data value into [the] target register." In addition, Claim 14 recites 
"bypass circuitry" capable of "transferring [the] first data value from [the] data cache directly to [the] 
target register without processing [the] first data value in [the] shifter circuit." 

Claim 14 is clear that the "shifter circuit" can process a data value before the data value is 
loaded from a data cache into a target register. Also, the "bypass circuitry" can transfer the data 
value from the data cache directly to the target register without the data value being processed by the 
"shifter circuit" (thereby bypassing the shifter circuit). 
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Examiner Li alleges that Greenley teaches 

A shifter circuit (Greenley 160,170 of Fig. 1) associated with said 
load store unit capable of one of a) shifting (Greenley Col.2 lines 19- 
31), b) sign extending (Greenley Col. 2 lines 48-54), and c) zero 
extending (Greenley Col. 2 lines 45-47) said first data value prior to 
loading said first data value into said target register; 
Examiner Li is incorrect. The passage cited by the Examiner is reproduced above with 
relation to claim 1, and does not teach or suggest that which the Examiner alleges. 

Examiner Li claims that sign extension unit 160 and alignment unit 170 together form the 
claimed "shifter circuit." Examiner Li is correct in that sign extension unit 160 does perform sign 
extending, and the alignment unit 170 does perform zero filling and shifting. In fact, Greenley 
teaches that the alignment must be assured, and that alignment unit 170 assures that the appropriate 
bits of the half-words are in the appropriate bit positions. See col. 2 y lines 19-30. It is clear from this 
that, in Greenley' s system, that the "shifter circuit" of 160/170 cannot be bypassed, as those 
functions must be present. 

Examiner Li's position is therefore contradicted by the express recitations of Greenley. 
Regarding the aligning unit, Greenley specifically recites that load access to a data cache "must" 
insure that the accessed data is aligned. (Col. 2, Lines 20-25). Greenley also specifically recites that 
"double words" are "similarly aligned by the aligning unit." Regarding the sign extension unit, 
Greenley specifically recites that "double-words" are "similarly sign extended" by "the sign 
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extension unit." (Col. 2, Lines 60-61), 

It is clear, then, that Greenley does not, and cannot, include "bypass circuitry" as claimed, 
and Greenley fails to disclose, teach, or suggest a structure that allows a data value to be transferred 
either (i) from a data cache to a target register through a shifter circuit, or (ii) directly from the data 
cache to the target register while bypassing the shifter circuit, as conceded in the Office Action. 

In Greenley, all data passes from the data cache 180 through the aligning unit 170 and the 
sign extension unit 160 into the register files 150. Because of this, Greenley recites that all data 
passes through a shifter. More specifically, Greenley recites that all data passes through a shifter (the 
alignment unit 170 and the sign extension unit 160) before the data is stored in a register file 150. 

Examiner Li instead references Hannah, and alleges that "Hannah has taught bypassing 
functions circuitry capable of transferring said first data value to said target without processing said 
first data value in said shifter circuit (Hannah column 9, lines 31-67; Figure 11; Figure 12, Figure 13, 
and Figure 14)." As reproduced above with relation to claim 1, this passage of Hannah teaches 
nothing of the sort. 

At no point in this passage does Hannah teach or suggest bypass circuitry, as claimed, that is 
capable of transferring a first data value from a data cache, as no data cache is shown or described 
here. One cache described by Hannah is read cache 208, shown in Figure 2, which contains mip 
maps used by interpolator 209. Interpolator 209 resamples the texture image to produce the output 
samples, which are sent to the graphics rasterizer. These mip maps are not data values used by the 
pending instructions in an instruction execution pipeline, to transferred into a target register, as 



Appeal Brief- Serial No. 09/751372 



Page 38 



Attorney Docket No. 00-BN-051 (STMI01-00051) 
U.S. Serial No. 09/751,372 
Patent 

required by the claims. Nor do the TRAM caches meet the limitations of the claims, nor the I/O 
caches (e.g. 407). 

As is clear, at no point in this passage, or anywhere else in Hannah, does Hannah teach or 
suggest bypass circuitry, as claimed, that is capable of transferring a first data value directly to a 
target register, as no registers are shown or described here. Hanna does not include any bypass 
circuitry that transfers any data values from a data cache to a register. 

As is clear, at no point in this passage, or anywhere else in Hannah, does Hannah teach or 
suggest bypass circuitry, as claimed, that is capable of transferring the first data value without 
processing the first data value in a shifter circuit. 

As is clear, at no point in this passage, or anywhere else in Hannah, does Hannah teach or 
suggest bypass circuitry, as claimed, that is associated with a load store unit. 

Examiner Li does not even attempt to show where these limitations may be found within 
Hannah or these figures, and fails to allege even a single specific element that could meet the claim 
limitations. Indeed, this passage of Hannah appears to be cited simply because it includes the term 
"bypassed". The particular sentence of Hannah the Examiner reiterates in response is "Since each 
slice handles both a sign bit and its extension, as well as fraction bits, these capabilities are disabled 
or bypassed as appropriate for the position of the slice in the larger word." The Examiner then 
acknowledges that Hannah doesn't actually teach any bypass circuitry, and simply states that 
Hannah's description "suggests" that the multiplexers are used to bypass certain capabilities 

The Examiner does not - and tacitly admits that she cannot - identify any specific circuit in 
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Hannah that is a bypass circuitry associated with a load store unit capable of transferring a first data 
value from a data cache directly to a target register without processing the first data value in a shifter 
circuit, as claimed. To the extent anything in Hannah is bypassed at all, there is no "bypass circuitry" 
connected as claimed or capable of performing the claimed functions. 

Examiner Li alleges that "Hannah has taught bypassing functions circuitry capable of 
transferring said first data value to said target without processing said first data value in said shifter 
circuit." Examiner Li is incorrect — no data cache is shown or described here, as required by the 
claims, and as Examiner Li attempts to ignore by simply referencing a "target". One cache 
described by Hannah is read cache 208, shown in Figure 2, which contains mip maps used by 
interpolator 209. Interpolator 209 resamples the texture image to produce the output samples, which 
are sent to the graphics rasterizer. These mip maps are not data values used by the pending 
instructions in an instruction execution pipeline, to transfer into a target register, as required by the 
claims. Nor do the TRAM caches meet the limitations of the claims, nor the I/O caches (e.g. 407). 

Examiner Li therefore fails to make a prima facie obviousness rejection. Even if there were 
motivation to combine these references, the proposed combination still does not include the claim 
limitations. 

Since Greenley does not teach or suggest the claimed features, and Hannah does not teach or 
suggest the claimed features, no combination of these references can teach or suggest the claimed 
features. As these features are found in all independent claims, all claims distinguish over Greenley, 
Hannah, and any combination of them. Greenley, in fact, teaches away from the Examiner's 
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proposed combination and further modification by teaching that the shift functions must be used for 
any data transfer. 

Applicant further notes that the Examiner Li's stated motivation is that "bypasses improve 
the performance of a system by minimizing delays from unnecessary functions (Hannah column 9, 
lines 38-44)". This portion of Hannah is part of the passage reproduced above, and it is clear that 
there is no such teaching in this passage. While Hannah teaches a trade-off between 
precision/resolution and speed/cost, this has nothing to do with any bypass circuitry, despite the 
Examiner's rhetorical bootstrapping. There is no proper motivation in the reference itself, explicitly 
or implicitly, nor in the knowledge of one of ordinary skill in the art, nor in the problem to be solved. 

The rejection of this claim should be reversed. 

Claim 15 

Claim 1 5 requires, among other limitations, that "said bypass circuitry transfers said first data 
value from said data cache directly to said target register during a load word operation." 

This limitation, in combination with the limitations of the parent claim, is not taught or 
suggested by any art of record, alone or in combination. 

Claim 15 depends from claim 14; therefore the arguments above with respect to the parent 
claim apply here as well, and are hereby incorporated by reference. 

Examiner Li does not make any attempt to show this limitation in the art , and so fails to make 
a prima facie rejection. The Examiner simply asserts that "Greenley's loading of a double word has 
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the same consequences as applicant's loading of a word." Even if this were true, it is irrelevant -the 
Examiner is required to show that the claim limitation itself is taught in some reference, and the 
Examiner cannot. 

As Greenley doesn't even include bypass circuitry, as the Examiner admits, the Examiner's 
reliance on Greeley for a limitation relating to the operation of the bypass circuitry is particularly 
empty. This particular transfer operation is not taught or suggested by any art of record, and certainly 
is not taught as being performed during a load word operation. 

The rejection of this claim should be reversed. 

Claim 16 

Claim 16 requires, among other limitations, that "bypass circuitry transfers said first data 
value from said data cache directly to said target register at the end of two machine cycles." 

This limitation, in combination with the limitations of the parent claims, is not taught or 
suggested by any art of record, alone or in combination. 

Claim 16 depends from claim 15; therefore the arguments above with respect to the parent 
claims apply here as well, and are hereby incorporated by reference. 

The Examiner again relies on Greenley to teach this limitation. As Greenley doesn't even 
include bypass circuitry, as the Examiner admits, the Examiner's reliance on Greeley for a limitation 
relating to the operation of the bypass circuitry is particularly empty. This particular transfer 
operation is not taught or suggested by any art of record, and certainly is not taught as being 



Appeal Brief- Serial No. 09/751,372 



Page 42 



Attorney Docket No. 00-BN-051 (STMI01-00051) 
U.S. Serial No. 09/751,372 
Patent 

performed during a load word operation, and at the end of two machine cycles. 
The rejection of this claim should be reversed. 

Claim 17 

Claim 17 requires, among other limitations, that "shifter circuit one of a) shifts, b) sign 
extends, or c) zero extends said first data value prior to loading said first data value into said target 
register during a load half-word operation." 

This limitation, in combination with the limitations of the parent claim, is not taught or 
suggested by any art of record, alone or in combination. 

Claim 17 depends from claim 14; therefore the arguments above with respect to the parent 
claim apply here as well, and are hereby incorporated by reference. The rejection of this claim 
should be reversed. 

Claim 18 

Claim 1 8 requires, among other limitations, that the "shifter circuit loads said shifted first 
data value into said target register at the end of three machine cycles." 

This limitation, in combination with the limitations of the parent claims, is not taught or 
suggested by any art of record, alone or in combination. 

Claim 1 8 depends from claim 17; therefore the arguments above with respect to the parent 
claims apply here as well, and are hereby incorporated by reference. 
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The Examiner makes no showing in the art of this limitation. Instead, the Examiner 
hypothesizes what latency savings might occur in the proposed combination, without making any 
reference to the teachings of the art itself, and then indicates that the claim limitation is simply a 
"change in magnitude of latency" over a hypothesis based on a proposed combination. The 
limitation is not taught or suggested by the art of record, alone or in combination, and the Examiner 
cannot make a showing of any actual relevant teaching in the art. 

The rejection of this claim should be reversed. 

Claim 19 

Claim 19 requires, among other limitations, that "shifter circuit one of a) shifts, b) sign 
extends, or c) zero extends said first data value prior to loading said first data value into said target 
register during a load byte operation." 

This limitation, in combination with the limitations of the parent claim, is not taught or 
suggested by any art of record, alone or in combination. 

Claim 19 depends from claim 14; therefore the arguments above with respect to the parent 
claim apply here as well, and are hereby incorporated by reference. The rejection of this claim 
should be reversed. 

Claim 20 

Claim 20 requires, among other limitations, that the "shifter circuit loads said shifted first 
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data value into said target register at the end of three machine cycles." 

This limitation, in combination with the limitations of the parent claims, is not taught or 
suggested by any art of record, alone or in combination. 

Claim 20 depends from claim 19, therefore the arguments above with respect to the parent 
claims apply here as well, and are hereby incorporated by reference. 

The Examiner makes no showing in the art of this limitation. Instead, the Examiner 
hypothesizes what latency savings might occur in the proposed combination, without making any 
reference to the teachings of the art itself, and then indicates that the claim limitation is simply a 
"change in magnitude of latency" over a hypothesis based on a proposed combination. The 
limitation is not taught or suggested by the art of record, alone or in combination, and the Examiner 
cannot make a showing of any actual relevant teaching in the art. 

The rejection of this claim should be reversed. 

Claim 21 

Claim 2 1 requires, among other limitations, that the "bypass circuitry comprises a multiplexer 
having a first input channel coupled to a data output of said data cache." 

This limitation, in combination with the limitations of the parent claims, is not taught or 
suggested by any art of record, alone or in combination. 

Claim 21 depends from claim 14; therefore the arguments above with respect to the parent 
claim apply here as well, and are hereby incorporated by reference. 
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As described above, neither reference teaches the claimed bypass circuitry. The Examiner 
does not even attempt to show that this limitation is met, and so fails to make even a prima facie 
obviousness rejection . Instead, the Examiner simply alleges that Hannah has a multiplexer with an 
input channel coupled to "an output of another device". 

This is not the claim limitation. Hannah does not teach or suggest the claim limitation. 
Virtually any multiplexer input is connected to an output of something, but that is not enough to 
teach or suggest the limitations of the claim. 

The rejection of this claim should be reversed. 

Claim 22 

Claim 22 requires, among other limitations, that the "multiplexer has a second input channel 
coupled to an output of said shifter circuit." 

This limitation, in combination with the limitations of the parent claims, is not taught or 
suggested by any art of record, alone or in combination. 

Claim 22 depends from claim 21 ; therefore the arguments above with respect to the parent 
claims apply here as well, and are hereby incorporated by reference. 

As described above, neither reference teaches the claimed bypass circuitry. The Examiner 
does not even attempt to show that this limitation is met, and so fails to make even a prima facie 
obviousness rejection . Instead, the Examiner simply alleges that Hannah has a multiplexer with an 
input channel coupled to "an output of another device". 
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This is not the claim limitation. Hannah does not teach or suggest the claim limitation. 
Virtually any multiplexer input is connected to an output of something, but that is not enough to 
teach or suggest the limitations of the claim. 

As can be seen, with respect to Claims 21 and 22, the Examiner has not shown any 
multiplexer connected as required, and has not even alleged any teaching corresponding to these 
claim limitations. 

The rejection of this claim should be reversed. 

Claim 23 

Independent claim 23 requires: 

23. A processor, comprising: 
a cache; 

a plurality of registers; 

a shifter circuit capable of shifting, sign extending, or zero 
extending a data value from the cache and providing a modified data 
value to a target one of the registers; and 

a bypass circuit capable of transferring the data value from the 
cache to the target register without processing the data value in the 
shifter circuit. 

Claim 23 recites a "shifter circuit" capable of shifting, sign extending, or zero extending a 
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data value from the cache and providing a modified data value to a target one of the registers. In 
addition, Claim 23 recites a bypass circuit capable of transferring the data value from the cache to the 
target register without processing the data value in the shifter circuit. 

Claim 23 is clear that "bypass circuit" can transfer the data value from the data cache directly 
to the target register without the data value being processed by the "shifter circuit" (thereby 
bypassing the shifter circuit). 

Examiner Li alleges that Greenley teaches 

A shifter circuit (Greenley 160,170 of Fig. 1) associated with said 
load store unit capable of one of a) shifting (Greenley Col. 2 lines 19- 
31), b) sign extending (Greenley Col. 2 lines 48-54), and c) zero 
extending (Greenley Col. 2 lines 45-47) said first data value prior to 
loading said first data value into said target register; 
Examiner Li is incorrect. The passage cited by the Examiner is reproduced above with 
relation to claim 1, and does not teach or suggest that which the Examiner alleges. 

Examiner Li claims that sign extension unit 160 and alignment unit 170 together form the 
claimed "shifter circuit." Examiner Li is correct in that sign extension unit 160 does perform sign 
extending, and the alignment unit 170 does perform zero filling and shifting. In fact, Greenley 
teaches that the alignment must be assured, and that alignment unit 170 assures that the appropriate 
bits of the half-words are in the appropriate bit positions. See col. 2, lines 19-30. It is clear from this 
that, in Greenley's system, that the "shifter circuit" of 160/170 cannot be bypassed, as those 
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functions must be present. 

Examiner Li's position is therefore contradicted by the express recitations of Greenley. 
Regarding the aligning unit, Greenley specifically recites that load access to a data cache "must" 
insure that the accessed data is aligned. {Col 2, Lines 20-25). Greenley also specifically recites that 
"double words" are "similarly aligned by the aligning unit." Regarding the sign extension unit, 
Greenley specifically recites that "double-words" are "similarly sign extended" by "the sign 
extension unit." {Col 2, Lines 60-61). 

It is clear, then, that Greenley does not, and cannot, include "bypass circuitry" as claimed, 
and Greenley fails to disclose, teach, or suggest a structure that allows a data value to be transferred 
either (i) from a data cache to a target register through a shifter circuit, or (ii) directly from the data 
cache to the target register while bypassing the shifter circuit, as conceded in the Office Action. 

In Greenley, all data passes from the data cache 180 through the aligning unit 170 and the 
sign extension unit 160 into the register files 150. Because of this, Greenley recites that all data 
passes through a shifter. More specifically, Greenley recites that all data passes through a shifter (the 
alignment unit 170 and the sign extension unit 160) before the data is stored in a register file 150. 

Examiner Li instead references Hannah, and alleges that "Hannah has taught bypassing 
functions circuitry capable of transferring said first data value to said target without processing said 
first data value in said shifter circuit (Hannah column9, lines 31-67; Figure ll;Figure 12, Figure 13, 
and Figure 14)." As reproduced above with relation to claim 1, this passage of Hannah teaches 
nothing of the sort. 
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At no point in this passage does Hannah teach or suggest bypass circuitry, as claimed, that is 
capable of transferring a first data value from a data cache, as no data cache is shown or described 
here. One cache described by Hannah is read cache 208, shown in Figure 2, which contains mip 
maps used by interpolator 209. Interpolator 209 resamples the texture image to produce the output 
samples, which are sent to the graphics rasterizer. These mip maps are not data values used by the 
pending instructions in an instruction execution pipeline, to transferred into a target register, as 
required by the claims. Nor do the TRAM caches meet the limitations of the claims, nor the I/O 
caches (e.g. 407). 

As is clear, at no point in this passage, or anywhere else in Hannah, does Hannah teach or 
suggest bypass circuitry, as claimed, that is capable of transferring a first data value directly to a 
target register, as no registers are shown or described here. Hanna does not include any bypass 
circuitry that transfers any data values from a data cache to a register. 

As is clear, at no point in this passage, or anywhere else in Hannah, does Hannah teach or 
suggest bypass circuitry, as claimed, that is capable of transferring the first data value without 
processing the first data value in a shifter circuit. 

As is clear, at no point in this passage, or anywhere else in Hannah, does Hannah teach or 
suggest bypass circuitry, as claimed, that is associated with a load store unit. 

Examiner Li does not even attempt to show where these limitations may be found within 
Hannah or these figures, and fails to allege even a single specific element that could meet the claim 
limitations. Indeed, this passage of Hannah appears to be cited simply because it includes the term 



Appeal Brief- Serial No. 09/751,372 



Page 50 



Attorney Docket No. 00-BN-051 (STMI01-00051) 
U.S. Serial No. 09/751,372 
Patent 

"bypassed". The particular sentence of Hannah the Examiner reiterates in response is "Since each 
slice handles both a sign bit and its extension, as well as fraction bits, these capabilities are disabled 
or bypassed as appropriate for the position of the slice in the larger word." The Examiner then 
acknowledges that Hannah doesn't actually teach any bypass circuitry, and simply states that 
Hannah's description "suggests" that the multiplexers are used to bypass certain capabilities 

The Examiner does not - and tacitly admits that she cannot - identify any specific circuit in 
Hannah that is a bypass circuitry associated with a load store unit capable of transferring a first data 
value from a data cache directly to a target register without processing the first data value in a shifter 
circuit, as claimed. To the extent anything in Hannah is bypassed at all, there is no "bypass circuitry" 
connected as claimed or capable of performing the claimed functions. 

Examiner Li alleges that "Hannah has taught bypassing functions circuitry capable of 
transferring said first data value to said target without processing said first data value in said shifter 
circuit." Examiner Li is incorrect — no data cache is shown or described here, as required by the 
claims, and as Examiner Li attempts to ignore by simply referencing a "target". One cache 
described by Hannah is read cache 208, shown in Figure 2, which contains mip maps used by 
interpolator 209. Interpolator 209 resamples the texture image to produce the output samples, which 
are sent to the graphics rasterizer. These mip maps are not data values used by the pending 
instructions in an instruction execution pipeline, to transfer into a target register, as required by the 
claims. Nor do the TRAM caches meet the limitations of the claims, nor the I/O caches (e.g. 407). 

Examiner Li therefore fails to make a prima facie obviousness rejection. Even if there were 
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motivation to combine these references, the proposed combination still does not include the claim 
limitations. 

Since Greenley does not teach or suggest the claimed features, and Hannah does not teach or 
suggest the claimed features, no combination of these references can teach or suggest the claimed 
features. As these features are found in all independent claims, all claims distinguish over Greenley, 
Hannah, and any combination of them. Greenley, in fact, teaches away from the Examiner's 
proposed combination and further modification by teaching that the shift functions must be used for 
any data transfer. 

Applicant further notes that the Examiner Li's stated motivation is that "bypasses improve 
the performance of a system by minimizing delays from unnecessary functions (Hannah column 9, 
lines 38-44)". This portion of Hannah is part of the passage reproduced above, and it is clear that 
there is no such teaching in this passage. While Hannah teaches a trade-off between 
precision/resolution and speed/cost, this has nothing to do with any bypass circuitry, despite the 
Examiner's rhetorical bootstrapping. There is no proper motivation in the reference itself, explicitly 
or implicitly, nor in the knowledge of one of ordinary skill in the art, nor in the problem to be solved. 

The rejection of this claim should be reversed. 

Claim 24 

Claim 24 requires, among other limitations, that "the data value is transferred from the cache 
to the target register via the bypass circuit during a load word operation." 
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This limitation, in combination with the limitations of the parent claim, is not taught or 
suggested by any art of record, alone or in combination. 

Claim 24 depends from claim 23; therefore the arguments above with respect to the parent 
claim apply here as well, and are hereby incorporated by reference. 

Examiner Li does not make any attempt to show this limitation in the art , and so fails to make 
a prima facie rejection. The Examiner simply asserts that "Greenley's loading of a double word has 
the same consequences as applicant's loading of a word." Even if this were true, it is irrelevant -the 
Examiner is required to show that the claim limitation itself is taught in some reference, and the 
Examiner cannot. 

As Greenley doesn't even include bypass circuitry, as the Examiner admits, the Examiner's 
reliance on Greeley for a limitation relating to the operation of the bypass circuitry is particularly 
empty. This particular transfer operation is not taught or suggested by any art of record, and certainly 
is not taught as being performed during a load word operation. 

The rejection of this claim should be reversed. 

Claim 25 

Claim 25 requires, among other limitations, that "the bypass circuit is capable of transferring 
the data value from the cache to the target register at an end of two machine cycles; and the shifter 
circuit is capable of providing the modified data value to the target register at an end of three 
machine cycles." 
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This limitation, in combination with the limitations of the parent claims, is not taught or 
suggested by any art of record, alone or in combination. 

Claim 25 depends from claim 24; therefore the arguments above with respect to the parent 
claims apply here as well, and are hereby incorporated by reference. 

The Examiner again relies on Greenley to teach this limitation. As Greenley doesn't even 
include bypass circuitry, as the Examiner admits, the Examiner's reliance on Greeley for a limitation 
relating to the operation of the bypass circuitry is particularly empty. This particular transfer 
operation is not taught or suggested by any art of record, and certainly is not taught as being 
performed during a load word operation, and at the end of two machine cycles. 

The rejection of this claim should be reversed. 

Claim 26 

Claim 26 requires, among other limitations, that the "bypass circuit comprises a multiplexer 
having a first input coupled to the cache and a second input coupled to the shifter circuit." 

This limitation, in combination with the limitations of the parent claims, is not taught or 
suggested by any art of record, alone or in combination. 

Claim 26 depends from claim 23; therefore the arguments above with respect to the parent 
claim apply here as well, and are hereby incorporated by reference. 

As described above, neither reference teaches the claimed bypass circuitry. The Examiner 
simply makes general reference to a passage of Hannah reproduced above and refers to several of 
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Hannah's figures. None of these depicts the claimed bypass circuitry. The Examiner completely 
fails to show any multiplexer in Hannah connected as claimed, and doesn't bother to identify any 
specific element at all. 

The rejection of this claim should be reversed. 

Claim 27 

Independent claim 27 provides: 

27. A method, comprising: 

shifting, sign extending, or zero extending a first data value 
from a cache and providing a modified first data value to a first of a 
plurality of registers; and 

transferring a second data value from the cache to a second of 
the plurality of registers without shifting, sign extending, or zero 
extending the second data value. 
Claim 27 requires transferring a second data value from the cache to a second of the plurality 
of registers without shifting, sign extending, or zero extending the second data value." This 
limitation is not taught or suggested by the art of record. The Examiner alleges that this is taught by 
Greenley, but the Examiner is mistaken. 

It is clear that the passages on which the Examiner relies do not teach or suggest anything at 
all related to the claim limitation. 
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The rejection of this claim should be reversed. 
Claim 28 

Claim 28 requires, among other limitations, "transferring the second data value comprises 
transferring the second data value to the second register in response to determining that a second 
pending instruction in the processor is a load word operation." 

This limitation, in combination with the limitations of the parent claims, is not taught or 
suggested by any art of record, alone or in combination. 

Claim 28 depends from claim 27; therefore the arguments above with respect to the parent 
claims apply here as well, and are hereby incorporated by reference. 

Neither Greenley or Hannah teach or suggest doing anything in response to determining that a 
second pending instruction in the processor is a load word operation as recited in Claim 28. The 
Examiner's rejection on this point repeats both the combination above and the motivation addressed 
below, neither of which teach or suggest anything at all with respect to this particular limitation. As 
the BPAI has noted recently, a consideration of when processes occur in response to other actions 
can illustrate a patentable distinction. 

The rejection of this claim should be reversed. 
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Claim 29 

Independent Claim 29 describes: 
29. A system, comprising: 

a processor comprising: 
a cache; 

a plurality of registers; 

a shifter circuit capable of shifting, sign extending, or 
zero extending a data value from the cache and providing a modified 
data value to a target one of the registers; and 

a bypass circuit capable of transferring the data value 
from the cache to the target register without processing the data value 
in the shifter circuit; 

a memory coupled to the processor; and 
a plurality of peripheral circuits capable of performing 
selected functions in association with the processor. 
Claim 29 recites a "shifter circuit" capable of shifting, sign extending, or zero extending a 
data value from the cache and providing a modified data value to a target one of the registers. In 
addition, Claim 29 recites a bypass circuit capable of transferring the data value from the cache to the 
target register without processing the data value in the shifter circuit. 

Claim 29 is clear that "bypass circuit" can transfer the data value from the data cache directly 
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to the target register without the data value being processed by the "shifter circuit" (thereby 
bypassing the shifter circuit). 

Examiner Li alleges that Greenley teaches 

A shifter circuit (Greenley 160,170 of Fig. 1) associated with said 
load store unit capable of one of a) shifting (Greenley Col. 2 lines 19- 
31), b) sign extending (Greenley Col. 2 lines 48-54), and c) zero 
extending (Greenley Col. 2 lines 45-47) said first data value prior to 
loading said first data value into said target register; 
Examiner Li is incorrect. The passage cited by the Examiner is reproduced above with 
relation to claim 1, and does not teach or suggest that which the Examiner alleges. 

Examiner Li claims that sign extension unit 160 and alignment unit 170 together form the 
claimed "shifter circuit." Examiner Li is correct in that sign extension unit 160 does perform sign 
extending, and the alignment unit 170 does perform zero filling and shifting. In fact, Greenley 
teaches that the alignment must be assured, and that alignment unit 170 assures that the appropriate 
bits of the half-words are in the appropriate bit positions. See col. 2, lines 19-30. It is clear from this 
that, in Greenley's system, that the "shifter circuit" of 160/170 cannot be bypassed, as those 
functions must be present. 

Examiner Li's position is therefore contradicted by the express recitations of Greenley. 
Regarding the aligning unit, Greenley specifically recites that load access to a data cache "must" 
insure that the accessed data is aligned. {Col 2, Lines 20-25). Greenley also specifically recites that 
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"double words" are "similarly aligned by the aligning unit." Regarding the sign extension unit, 
Greenley specifically recites that "double-words" are "similarly sign extended" by "the sign 
extension unit." (Col. 2, Lines 60-61). 

It is clear, then, that Greenley does not, and cannot, include "bypass circuitry" as claimed, 
and Greenley fails to disclose, teach, or suggest a structure that allows a data value to be transferred 
either (i) from a data cache to a target register through a shifter circuit, or (ii) directly from the data 
cache to the target register while bypassing the shifter circuit, as conceded in the Office Action. 

In Greenley, all data passes from the data cache 180 through the aligning unit 170 and the 
sign extension unit 160 into the register files 150. Because of this, Greenley recites that all data 
passes through a shifter. More specifically, Greenley recites that all data passes through a shifter (the 
alignment unit 170 and the sign extension unit 160) before the data is stored in a register file 150. 

Examiner Li instead references Hannah, and alleges that "Hannah has taught bypassing 
functions circuitry capable of transferring said first data value to said target without processing said 
first data value in said shifter circuit (Hannah column 9, lines 3 1-67; Figure 11; Figure 12, Figure 13, 
and Figure 14)." As reproduced above with relation to claim 1, this passage of Hannah teaches 
nothing of the sort. 

At no point in this passage does Hannah teach or suggest bypass circuitry, as claimed, that is 
capable of transferring a first data value from a data cache, as no data cache is shown or described 
here. One cache described by Hannah is read cache 208, shown in Figure 2, which contains mip 
maps used by interpolator 209. Interpolator 209 resamples the texture image to produce the output 
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samples, which are sent to the graphics rasterizer. These mip maps are not data values used by the 
pending instructions in an instruction execution pipeline, to transferred into a target register, as 
required by the claims. Nor do the TRAM caches meet the limitations of the claims, nor the I/O 
caches (e.g. 407). 

As is clear, at no point in this passage, or anywhere else in Hannah, does Hannah teach or 
suggest bypass circuitry, as claimed, that is capable of transferring a first data value directly to a 
target register, as no registers are shown or described here. Hanna does not include any bypass 
circuitry that transfers any data values from a data cache to a register. 

As is clear, at no point in this passage, or anywhere else in Hannah, does Hannah teach or 
suggest bypass circuitry, as claimed, that is capable of transferring the first data value without 
processing the first data value in a shifter circuit. 

As is clear, at no point in this passage, or anywhere else in Hannah, does Hannah teach or 
suggest bypass circuitry, as claimed, that is associated with a load store unit. 

Examiner Li does not even attempt to show where these limitations may be found within 
Hannah or these figures, and fails to allege even a single specific element that could meet the claim 
limitations. Indeed, this passage of Hannah appears to be cited simply because it includes the term 
"bypassed". The particular sentence of Hannah the Examiner reiterates in response is "Since each 
slice handles both a sign bit and its extension, as well as fraction bits, these capabilities are disabled 
or bypassed as appropriate for the position of the slice in the larger word." The Examiner then 
acknowledges that Hannah doesn't actually teach any bypass circuitry, and simply states that 
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Hannah's description "suggests" that the multiplexers are used to bypass certain capabilities 

The Examiner does not - and tacitly admits that she cannot - identify any specific circuit in 
Hannah that is a bypass circuitry associated with a load store unit capable of transferring a first data 
value from a data cache directly to a target register without processing the first data value in a shifter 
circuit, as claimed. To the extent anything in Hannah is bypassed at all, there is no "bypass circuitry" 
connected as claimed or capable of performing the claimed functions. 

Examiner Li alleges that "Hannah has taught bypassing functions circuitry capable of 
transferring said first data value to said target without processing said first data value in said shifter 
circuit." Examiner Li is incorrect — no data cache is shown or described here, as required by the 
claims, and as Examiner Li attempts to ignore by simply referencing a "target". One cache 
described by Hannah is read cache 208, shown in Figure 2, which contains mip maps used by 
interpolator 209. Interpolator 209 resamples the texture image to produce the output samples, which 
are sent to the graphics rasterizer. These mip maps are not data values used by the pending 
instructions in an instruction execution pipeline, to transfer into a target register, as required by the 
claims. Nor do the TRAM caches meet the limitations of the claims, nor the I/O caches (e.g. 407). 

Examiner Li therefore fails to make a prima facie obviousness rejection. Even if there were 
motivation to combine these references, the proposed combination still does not include the claim 
limitations. 

Since Greenley does not teach or suggest the claimed features, and Hannah does not teach or 
suggest the claimed features, no combination of these references can teach or suggest the claimed 
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features. As these features are found in all independent claims, all claims distinguish over Greenley, 
Hannah, and any combination of them. Greenley, in fact, teaches away from the Examiner's 
proposed combination and further modification by teaching that the shift functions must be used for 
any data transfer. 

Applicant further notes that the Examiner Li's stated motivation is that "bypasses improve 
the performance of a system by minimizing delays from unnecessary functions (Hannah column 9, 
lines 38-44)". This portion of Hannah is part of the passage reproduced above, and it is clear that 
there is no such teaching in this passage. While Hannah teaches a trade-off between 
precision/resolution and speed/cost, this has nothing to do with any bypass circuitry, despite the 
Examiner's rhetorical bootstrapping. There is no proper motivation in the reference itself, explicitly 
or implicitly, nor in the knowledge of one of ordinary skill in the art, nor in the problem to be solved. 

The rejection of this claim should be reversed. 

All rejections are traversed, and should be reversed. 

Applicant further notes that the Examiner's stated motivation is that "bypasses improve the 
performance of a system by minimizing delays from unnecessary functions (Hannah column 9, lines 
38-44)". This portion of Hannah is part of a passage reproduced above, and it is clear that there is no 
such teaching in this passage. While Hannah teaches a trade-off between precision/resolution and 
speed/cost, this has nothing to do with any bypass circuitry, despite the Examiner's rhetorical 
bootstrapping. 

Other distinctions remain, but need not be addressed at this time, as all claims are shown to 
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distinguish over the art of record, alone or in combination. 
Grouping of Claims 

The claims on appeal do not stand or fall together, as may be seen from the arguments set 
forth below. Each claim has been argued separately under a separate subheading, and each claim 
should be considered separately. While the applicant recognizes that a formal statement regarding 
the grouping of claims is no longer required, each claim should be considered separately; or at the 
very least each claim which is argued separately in the preceding sections of this brief should be 
considered separately. Argument: The fact that the claims use different formulations (as detailed 
above) and/or have been argued separately, shows that, if their patentability is not considered 
separately, any adverse decision would show that the limitations of some claims had been unfairly 
ignored. 
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REQUESTED RELIEF 

The Board is respectfully requested to reverse the outstanding rejections and return this 

application to the Examiner for allowance. 



Respectfully submitted, 
MUNCK BUTRUS, P.C. 



Date: (? -^^ QO^ 

P.O. Drawer 800889 

Dallas, Texas 75380 

Phone: (972) 628-3600 

Fax: (972)628-3616 

E-mail : dvenglarik@munckbutrus. com 




E. Venglari} 
Registration No. $,409 



Attorney for Applicant 



Appeal Brief- Serial No. 09/751,372 



Page 64 



1 w 



DOCKET NO.: 00-BN-051 (STMI01-00051) 
Customer No. 30425 



Attorney Docket No. 00-BN-051 (STMI01-00051) 
U.S. Serial No. 09/751,372 
Patent 

PATENT 



IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



In re application of: 
Serial No.: 
Filed: 
For: 

Group No.: 
Examiner: 



Anthony X. Jarvis, et al. 
09/751,372 
December 29, 2000 

SYSTEM AND METHOD FOR EXECUTING 
VARIABLE LATENCY LOAD OPERATIONS IN A 
DATA PROCESSOR 

2183 

Aimee J. Li 



APPENDIX A - 
Text of Claims on Appeal 



Appeal Brief- Serial No. 09/751,372 



Appendix A, Page 1 



Attorney Docket No. OO-BN-051 (STMI01-00051) 

U.S. Serial No. 09/751,372 
Patent 

1 . (Previously Presented) A data processor comprising: 

an instruction execution pipeline comprising N processing stages, each of said N processing 
stages capable of performing one of a plurality of execution steps associated with a pending 
instruction being executed by said instruction execution pipeline; 

a data cache capable of storing data values used by said pending instruction; 

a plurality of registers capable of receiving said data values from said data cache; 

a load store unit capable of transferring a first one of said data values from said data cache to 
a target one of said plurality of registers during execution of a load operation; 

a shifter circuit associated with said load store unit capable of one of a) shifting, b) sign 
extending, or c) zero extending said first data value prior to loading said first data value into said 
target register; and 

bypass circuitry associated with said load store unit capable of transferring said first data 
value from said data cache directly to said target register without processing said first data value in 
said shifter circuit. 

2. (Original) The data processor as set forth in Claim 1 wherein said bypass circuitry 
transfers said first data value from said data cache directly to said target register during a load word 
operation. 
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3 . (Original) The data processor as set forth in Claim 2 wherein said bypass circuitry 
transfers said first data value from said data cache directly to said target register at the end of two 
machine cycles. 

4. (Previously Presented) The data processor as set forth in Claim 1 wherein said 
shifter circuit one of a) shifts, b) sign extends, or c) zero extends said first data value prior to loading 
said first data value into said target register during a load half-word operation. 

5 . (Original) The data processor as set forth in Claim 4 wherein said shifter circuit 
loads said shifted first data value into said target register at the end of three machine cycles. 

6. (Previously Presented) The data processor as set forth in Claim 1 wherein said 
shifter circuit one of a) shifts, b) sign extends, or c) zero extends said first data value prior to loading 
said first data value into said target register during a load byte operation. 

7. (Original) The data processor as set forth in Claim 6 wherein said shifter circuit 
loads said shifted first data value into said target register at the end of three machine cycles. 

8 . (Original) The data processor as set forth in Claim 1 wherein said bypass circuitry 
comprises a multiplexer having a first input channel coupled to a data output of said data cache. 
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9 . (Original) The data processor as set forth in Claim 8 wherein said multiplexer has 
a second input channel coupled to an output of said shifter circuit. 

1 0. (Previously Presented) A method of loading a first data value from a data cache 
into a target register of a plurality of registers, the method comprising the steps of: 

determining if a pending instruction in an N-stage execution pipeline is one of a load word 
operation, a load half-word operation, and a load byte operation; 

in response to a determination that the pending instruction is a load half-word operation, 
transferring the first data value from the data cache to a shifter circuit and shifting the first data value 
prior to loading the first data value into the target register; 

in response to a determination that the pending instruction is a load byte operation, 
transferring the first data value from the data cache to the shifter circuit and shifting the first data 
value prior to loading the first data value into the target register; and 

in response to a determination that the pending instruction is a load word operation, 
transferring the first data value from the data cache directly to the target register without processing 
the first data value in the shifter circuit. 

1 1 . (Original) The method as set forth in Claim 1 0 wherein the step of transferring 
the first data value requires two machine cycles during a load word operation. 
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12. (Original) The method as set forth in Claim 10 wherein the step of transferring 
the first data value requires three machine cycles during a load half-word operation. 

13. (Original) The method as set forth in Claim 10 wherein the step of transferring 
the first data value requires three machine cycles during a load byte operation. 

14. (Previously Presented) A processing system comprising: 
a data processor comprising: 

an instruction execution pipeline comprising N processing stages, each of said N 
processing stages capable of performing one of a plurality of execution steps associated with a 
pending instruction being executed by said instruction execution pipeline; 

a data cache capable of storing data values used by said pending instruction; 

a plurality of registers capable of receiving said data values from said data cache; 

a load store unit capable of transferring a first one of said data values from said data 
cache to a target one of said plurality of registers during execution of a load operation; 

a shifter circuit associated with said load store unit capable of one of a) shifting, b) 
sign extending, or c) zero extending said first data value prior to loading said first data value into 
said target register; and 

bypass circuitry associated with said load store unit capable of transferring said first 
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data value from said data cache directly to said target register without processing said first data value 
in said shifter circuit; 

a memory coupled to said data processor; and 

a plurality of memory-mapped peripheral circuits coupled to said data processor for 
performing selected functions in association with said data processor. 

15. (Original) The processing system as set forth in Claim 14 wherein said bypass 
circuitry transfers said first data value from said data cache directly to said target register during a 
load word operation. 

16. (Original) The processing system as set forth in Claim 15 wherein said bypass 
circuitry transfers said first data value from said data cache directly to said target register at the end 
of two machine cycles. 

1 7. (Previously Presented) The processing system as set forth in Claim 14 wherein 
said shifter circuit one of a) shifts, b) sign extends, or c) zero extends said first data value prior to 
loading said first data value into said target register during a load half-word operation. 

18. (Original) The processing system as set forth in Claim 17 wherein said shifter 
circuit loads said shifted first data value into said target register at the end of three machine cycles. 
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1 9. (Previously Presented) The processing system as set forth in Claim 14 wherein 
said shifter circuit one of a) shifts, b) sign extends, or c) zero extends said first data value prior to 
loading said first data value into said target register during a load byte operation. 

20. (Original) The processing system as set forth in Claim 19 wherein said shifter 
circuit loads said shifted first data value into said target register at the end of three machine cycles. 

21 . (Original) The processing system as set forth in Claim 14 wherein said bypass 
circuitry comprises a multiplexer having a first input channel coupled to a data output of said data 
cache. 

22. (Original) The processing system as set forth in Claim 21 wherein said 
multiplexer has a second input channel coupled to an output of said shifter circuit. 

23. (Previously Presented) A processor, comprising: 
a cache; 

a plurality of registers; 

a shifter circuit capable of shifting, sign extending, or zero extending a data value from the 
cache and providing a modified data value to a target one of the registers; and 
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a bypass circuit capable of transferring the data value from the cache to the target register 
without processing the data value in the shifter circuit. 

24. (Previously Presented) The processor of Claim 23, wherein: 

the data value is transferred from the cache to the target register via the bypass circuit during 
a load word operation; and 

the data value is transferred from the cache to the target register via the shifter circuit during a 
load half-word operation or a load byte operation. 

25. (Previously Presented) The processor of Claim 24, wherein: 

the bypass circuit is capable of transferring the data value from the cache to the target register 
at an end of two machine cycles; and 

the shifter circuit is capable of providing the modified data value to the target register at an 
end of three machine cycles. 

26. (Previously Presented) The processor of Claim 23, wherein the bypass circuit 
comprises a multiplexer having a first input coupled to the cache and a second input coupled to the 
shifter circuit. 

27. (Previously Presented) A method, comprising: 
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shifting, sign extending, or zero extending a first data value from a cache and providing a 
modified first data value to a first of a plurality of registers; and 

transferring a second data value from the cache to a second of the plurality of registers 
without shifting, sign extending, or zero extending the second data value. 

28. (Previously Presented) The method of Claim 27, wherein: 

shifting, sign extending, or zero extending the first data value comprises shifting, sign 
extending, or zero extending the first data value in response to determining that a first pending 
instruction in a processor is a load byte operation or a load half-word operation; and 

transferring the second data value comprises transferring the second data value to the second 
register in response to determining that a second pending instruction in the processor is a load word 
operation. 

29. (Previously Presented) A system, comprising: 
a processor comprising: 

a cache; 

a plurality of registers; 

a shifter circuit capable of shifting, sign extending, or zero extending a data value 
from the cache and providing a modified data value to a target one of the registers; and 

a bypass circuit capable of transferring the data value from the cache to the target 
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register without processing the data value in the shifter circuit; 
a memory coupled to the processor; and 

a plurality of peripheral circuits capable of performing selected functions in association with 
the processor. 
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Not Applicable - No other evidence was entered. 
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Not Applicable - To the best knowledge and belief of the undersigned attorney, there are none. 
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